CDS
Accession Number | TCMCG078C07365 |
gbkey | CDS |
Protein Id | KAG0460577.1 |
Location | join(3367595..3367831,3367910..3367978,3368056..3368193,3368315..3368432,3368689..3368771,3369269..3369364,3369991..3370159,3370238..3370380,3370453..3370539,3370690..3370875,3371553..3371720,3371925..3372045,3372117..3372186,3372362..3372449,3372570..3372690,3373824..3374017,3374102..3374210,3374649..3374804,3379406..3379508,3379873..3380046,3380188..3380333,3421719..3421918) |
Organism | Vanilla planifolia |
locus_tag | HPP92_020874 |
Protein
Length | 991aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA633886, BioSample:SAMN14973820 |
db_source | JADCNL010000011.1 |
Definition | hypothetical protein HPP92_020874 [Vanilla planifolia] |
Locus_tag | HPP92_020874 |
EGGNOG-MAPPER Annotation
COG_category | G |
Description | Belongs to the glycosyl hydrolase 31 family |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction |
R00028
[VIEW IN KEGG] R00801 [VIEW IN KEGG] R00802 [VIEW IN KEGG] R06087 [VIEW IN KEGG] R06088 [VIEW IN KEGG] |
KEGG_rclass |
RC00028
[VIEW IN KEGG] RC00049 [VIEW IN KEGG] RC00077 [VIEW IN KEGG] |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko01000 [VIEW IN KEGG] |
KEGG_ko |
ko:K01187
[VIEW IN KEGG] |
EC |
3.2.1.20
[VIEW IN KEGG]
[VIEW IN INGREDIENT] |
KEGG_Pathway |
ko00052
[VIEW IN KEGG] ko00500 [VIEW IN KEGG] ko01100 [VIEW IN KEGG] map00052 [VIEW IN KEGG] map00500 [VIEW IN KEGG] map01100 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGGATGACGGGGTCGTGAACATGGAGAACAAAGCCGGTCGTATGGTATTCGAGCCTATCCTTGAGGAGGGGGTCTTTCGGTTCGATTGCTCGGGGACTGATCGCGCCGCCGCGTTTCCTAGCCTCTCCTTCGCCGATCCTAAGGTCAGGGAGACTCCTCTCGCTGTCCATGGAATGATCCCCGAGTTCGTCCCTGTCTTTGAGTGCGTTCATGGCCAGCAGAAGGTGCAGGTCCAGCTTCCTTTGGGGACATCTCTCTATGGAACTGGGGAAGTAAGCGGGCCGCTCGAGAGAACTGGAAAACGAATCTTTACATGGAACACGGATGCATGGGGTTATGGCCCCGGAACGACCTCCTTGTACCAGTCTCATCCTTGGGTTCTGGCTGTTTTTCCTGATGGGAAAAGCTTAGGTGTGCTTGCTGATACGACGAGGCGTTGTGAGATTGATCTCCGGGAGAATTCTACCATAAAGTTTGTATCTGCAGCTGTGTACCCTGTAATCACATTTGGCCCATTTGAGTCGCCTACTCGTGTCTTGATATCTTTGTCTCATGCAATAGGAACTGTTTTTATGCCTCCAAAATGGTCTCTTGGTTATCATCAATGCCGCTGGAGCTATGAGACTGATGCAAGAGTTCGTGAGGTGGCTACTAATTTTCTTGAAAGAGGCATACCTTGTGATGTTATATGGATGGACATTGACTACATGCATGGTTTCCGGTGCTTTACTTTTGATAAAGAGCGTTTTCCTGATCCGAAAGCTTTGGTGAATGACCTTCATGCCATGGGCATCAAAGCAATTTGGATGCTTGACCCTGGAATCAAACATGAGGAGGGTTATTTTGTTTATGAAAGTGGTTCCAAGCATAATTTATGGATCTTGAAGGAAGATGAGAATCTTTTTGTGGGGGATGTATGGCCAGGGCCTTGTGTGTTCCCAGATTTCACTAAGAAAGAAGCACGATTTTGGTGGGCTAATTTGGTAAAAGATTTTGTTTCTAATGGTGTTGATGGGATTTGGAATGATATGAATGAACCTGCTATTTTCAAAACGGTTACAAAAACGATGCCTGAAAGCAACATACACAGGGGAGATGCCGAACTTGGTGGTCGACAATCACACTCCCATTATCATAATGTATATGGCATGCTTATGGCAAGATCAACATATGAGGGAATGAAAATGGCTAATGAAGGAAAGCGTCCCTTTGTTCTCACTAGGGCTGGATTCATAGGAAGTCAGCGCTATGCTGCAACCTGGACCGGAGATAACTTGTCTAATTGGGAGCATCTGCATATGAGTGTGCCAATGGTTATTCAACTGGGTCTAAGTGGTCAGCCGTTATCAGGACCAGATATTGGTGGATTCGCTGGTAATGCAACTCCAAGGCTCTTTGGAAGATGGATGGGAGTGGGTGCCATGTTTCCATTTTGTCGTGGGCACTCTGAAGCTGGAACAATTGATCAGGAACCTTGGTCATTTGGAAAAGAGTGTGAAGAAATATGTCGATTGGCTATTTTAAGGCGGTCTAGGCTTATACCTCACATTTATACACTTTTCTATGAGGCCCATGCAAATGGAACTCCCATTATCTCGCCCACTTTTTTCGCTGATCCTAAGGACCAGAAATTGAGGAAAGTTGAAAATTCCTTTCTACTTGGATCACTTTTGGTTTGTGCAAGCACCATTCCTGAACGAGGATCACATGAATTATCCTTCACATTACCAGCTGGAACTTGGATGAGATTTGATTTTGATGATTCACATCCAGATTTGCCCATATTATTCTTGCAAGGAGGTTCAATACTTCCTGTGGGTCCTACTCTTCAGCATCTTGGTCAAGCTACTCGAACCGATGAGTTATCACTCTTTATAGCTTTAGACAAAAATGGTAAAGCTGAAGGAGTTTTGTTCGAGGATGATGGCGATGGTTATGGTTACACCCAGGGAGCCTATCTCTTGACCTACTATGCTGCAGCATTGAGCTCTTCTATTGTTACAGTGAGCATCTCCCGAACAGAAGGGTTGTGGAAGAGAGCCAATCGAAGTCTACATGTGCATGTCTTACTTGGTGGTGGAGCAATGGTAGAGGGTTGGGGAATTGATGGTGAAGAAGTGCAAATAACCATGCCTACAGAATCTGAGGTGTTTAACATGGCATCAGCAAGTGAAGCTCAACATAGGGAACGGATGGGTAAAGCTAAGCTTCTCCCAGATGCTGCTGCTATCTCTGGAAATAAGGGTTTTGAGCTATCCAAGACCCCTCTCGAGATCAAGGGTAGGGACTGGCTGCTTAAAGTGGTGCCATGGATTGGTGGTCGAATGATCTCCATGATACATCTTCCTTCAGCGACCCAGTGGCTTCACAGTAGGTTTGAAGCAGATGGATACGAAGAGTATAGCGGCATCGAATACAGATCTGCAGGATGCTCTGAAGAATATCAAGTTGTAGGGAGAAATCTCGAGCAGTCTGGGGAAGAAGAAGCTCTTACCCTAGAAGGAGATATTGGTGGTGGATTAGTGCTCCAACGCAGCATATTTATTCCTAAAGATGCTCCACAGATACTAGCGATATGTTCTCGCATAATAGCGCGAAATGTGGGTGCTGGCTCTGGTGGATTTTCAAGGATGGTTTGCTTGCGGGTGCACCCAACTTTTACCCTGTTGCATCCTGCCGAGGTGCTCGTTGTGTTCGACTCCATTGATGGCACAAAGCATGAGATCAGACCTGAAGCAGGAGAACAAACGTTGGAAGGAGATATCCTCCCTAATGGAGAATGGATGCTGGTTGACAAGTGCACGGGCCTGGGGCTTGTGAACAGATTTGATATCAACCAAGTGAACAAATGCATGATTCATTGGGGAAGTCGAACTGTTAATTTGGAGCTGTGGTCTGTAGAAAGGCCTGTTTCAGTGGAGACTCCCTTGGAGATTTCTCACGAATACGAGGTGAAGGAGGTGAACTTGTATTAG |
Protein: MDDGVVNMENKAGRMVFEPILEEGVFRFDCSGTDRAAAFPSLSFADPKVRETPLAVHGMIPEFVPVFECVHGQQKVQVQLPLGTSLYGTGEVSGPLERTGKRIFTWNTDAWGYGPGTTSLYQSHPWVLAVFPDGKSLGVLADTTRRCEIDLRENSTIKFVSAAVYPVITFGPFESPTRVLISLSHAIGTVFMPPKWSLGYHQCRWSYETDARVREVATNFLERGIPCDVIWMDIDYMHGFRCFTFDKERFPDPKALVNDLHAMGIKAIWMLDPGIKHEEGYFVYESGSKHNLWILKEDENLFVGDVWPGPCVFPDFTKKEARFWWANLVKDFVSNGVDGIWNDMNEPAIFKTVTKTMPESNIHRGDAELGGRQSHSHYHNVYGMLMARSTYEGMKMANEGKRPFVLTRAGFIGSQRYAATWTGDNLSNWEHLHMSVPMVIQLGLSGQPLSGPDIGGFAGNATPRLFGRWMGVGAMFPFCRGHSEAGTIDQEPWSFGKECEEICRLAILRRSRLIPHIYTLFYEAHANGTPIISPTFFADPKDQKLRKVENSFLLGSLLVCASTIPERGSHELSFTLPAGTWMRFDFDDSHPDLPILFLQGGSILPVGPTLQHLGQATRTDELSLFIALDKNGKAEGVLFEDDGDGYGYTQGAYLLTYYAAALSSSIVTVSISRTEGLWKRANRSLHVHVLLGGGAMVEGWGIDGEEVQITMPTESEVFNMASASEAQHRERMGKAKLLPDAAAISGNKGFELSKTPLEIKGRDWLLKVVPWIGGRMISMIHLPSATQWLHSRFEADGYEEYSGIEYRSAGCSEEYQVVGRNLEQSGEEEALTLEGDIGGGLVLQRSIFIPKDAPQILAICSRIIARNVGAGSGGFSRMVCLRVHPTFTLLHPAEVLVVFDSIDGTKHEIRPEAGEQTLEGDILPNGEWMLVDKCTGLGLVNRFDINQVNKCMIHWGSRTVNLELWSVERPVSVETPLEISHEYEVKEVNLY |